Method for minimizing scintillation in dynamic images

ABSTRACT

A method for reducing scintillation in an image of a scene includes receiving an input sequence of images of the scene and grouping a first plurality of images of the sequence of images into a first subset of images comprising a first number of images that occur in sequence within the input sequence of images of the scene. The method also includes grouping a second plurality of images of the sequence of images into a second subset of images comprising a second number of images that occur in sequence within the input sequence of images of the scene. The method further includes generating a set of averaged images comprising an averaged image for the first subset of images and an averaged image for the second subset of image and outputting a composite image based at least the set of averaged images.

CROSS-REFERENCES TO RELATED APPLICATIONS

This application is a continuation of U.S. patent application Ser. No. 12/130,294, filed on May 30, 2008, now U.S. Pat. No. 8,073,199, the disclosure of which is hereby incorporated by reference in its entirety for all purposes.

FIELD OF THE INVENTION

The present invention relates, generally, to image processing and, more particularly, to a method for minimizing scintillation in dynamic images.

BACKGROUND OF THE INVENTION

Optical scintillation is typically caused by atmospheric turbulence or other atmospheric non-homogeneities, such as snow, rain, smoke, fog, underwater fluctuations or rising of pavement heat, which distorts an optical flow of a scene or an object being photographed. This optical distortion causes degradation in the quality of images taken by optical systems, used to form and/or record images.

In addition to atmospheric turbulences, when optical systems obtain images based on image data that have passed through a turbulent medium, the obtained images can also be distorted by the components of the optical systems. For example, the image of an object viewed with a telescope or other long-range imaging system may be distorted by mechanical, thermal, and optical limitations of the instrument.

Several approaches or methods have been used to mitigate or eliminate the effects of image distortion due to atmospheric turbulence. One approach relied on obtaining corrective information within the wavelength regime(s) in which imaging data is desired. For example, visible image data are used to correct visible images, and infrared image data are used to correct infrared images. However, this technique may be prohibitively expensive and impractical due to the additional complexity imposed on the optical systems.

Another approach, used in the area of astronomy, involve adaptive optics to correct each frame by sensing the wavefront distortion induced by the turbulence and providing a servo-controlled phase screen, often referred to as a rubber-mirror. However, imaging systems using adaptive optics to correct for atmospheric turbulence are complex and expensive.

Therefore, there is a need to remedy the problems noted above and others previously experienced for minimizing scintillation in dynamic images while retaining temporal object motion.

SUMMARY OF THE INVENTION

The foregoing problems are solved and a technical advance is achieved by methods, systems and articles of manufacture consistent with the present invention, which provide scintillation removal or minimization in an image of a scene or object.

In accordance with methods consistent with the present invention, a method for minimizing scintillation in an image of a scene or object is provided. The method comprises collecting a plurality of images, dividing the plurality of images into subsets of images, averaging each of the subsets of images to produce corresponding averaged images, differencing the averaged images, masking the difference images, and combining the masked images to produce a composite image.

In accordance with systems consistent with the present invention, an image processing system is provided. The system comprises a memory and a processing unit coupled to the memory wherein the processing unit is configured to execute the above noted method steps.

In accordance with articles of manufacture consistent with the present invention, there is provided a computer-readable medium containing a program adapted to cause a data processing system to execute the above-noted method steps. In this regard, the computer-readable medium may be a computer-readable medium, such as solid-state memory, magnetic memory such as a magnetic disk, optical memory such as an optical disk, or a computer-readable transmission medium, such as a modulated wave (such as radio frequency, audio frequency or optical frequency modulated waves) or a modulated downloadable bit stream that can be received by a computer via a network or a via a wireless connection.

Other systems, methods, features, and advantages of the present invention will be or will become apparent to one with skill in the art upon examination of the following figures and detailed description. It is intended that all such additional systems, methods, features, and advantages be included within this description, be within the scope of the invention, and be protected by the accompanying claims.

BRIEF DESCRIPTION OF THE DRAWINGS

The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate an implementation of the present invention and, together with the description, serve to explain the advantages and principles of the invention. In the drawings:

FIG. 1 is a schematic diagram illustrating an optical flow of an object traversing an atmospheric turbulence and captured by an optical system in accordance with the present invention;

FIG. 2A is a block diagram illustrating one embodiment of an imaging system for removing scintillation from captured dynamic images in accordance with the present invention;

FIG. 2B is a block diagram illustrating the data acquisition and processing unit of FIG. 2A;

FIG. 3 is a block diagram illustrating a process for removing scintillation from dynamic images performed by the imaging system of FIG. 2 in accordance with the present invention;

FIG. 4 is a flow diagram of the process for removing scintillation from dynamic images performed by the imaging system of FIG. 2 in accordance with the present invention;

FIG. 5 is a schematic diagram illustrating the averaging of image sequences and the differencing of the resulting averaged image sequences in accordance with the present invention;

FIGS. 6A-6E illustrate an exemplary sequence of averaged, differenced and masked sets of images produced during the process of minimizing scintillation and restoring moving objects in the final composite image in accordance with the present invention; and

FIG. 7 is a flow diagram illustrating an exemplary embodiment of a local area process (LAP) performed by the image acquisition and processing unit of FIG. 2 in accordance with the present invention.

DETAILED DESCRIPTION OF THE INVENTION

Reference will now be made in detail to an implementation consistent with the present invention as illustrated in the accompanying drawings. Wherever possible, the same reference numbers will be used throughout the drawings and the following description to refer to the same or like parts. As would be understood to one of ordinary skill in the art, certain components or elements for enhancing interpretability of laser illuminated images are not shown in the figures or specifically noted herein to avoid obscuring the invention.

Referring to FIG. 1 is a schematic diagram 100 illustrating an optical flow of an object 102 traversing an atmospheric turbulence 104 and captured by a focal array 108 via a lens 106 in accordance with the present invention.

The focal array 108 receives light representing the object 102, and captures image data signals representing the degraded image of the object 102. The light received can be distorted by atmospheric conditions (or turbulence) 104, such as rain, snow, fog, smoke, or underwater fluctuations, such that the images captured by focal array 108 are degraded or blurred. The focal array 108 is coupled to an image processing system 110 configured to process the captured dynamic images.

Now referring to FIG. 2A, an embodiment of the image processing or imaging system 110 is illustrated in accordance with the present invention. The imaging system 110 includes an image sensing or detection unit 202, an image acquisition and processing unit 204, a data recording and storage unit 206, a display unit 208, and a communication interface unit 210. Alternately, one or more of these devices, such as the display unit 208, may be incorporated into the image acquisition and processing unit 104, hereafter referred to as the processing unit 204.

The detection unit 202, hereafter referred to as the detector, comprises a focal plane array of pixels or photodetectors suitable for detecting an input image (e.g. 203 a), such as an image degraded by atmospheric turbulence, during an integration and acquisition period, which may be triggered by the image acquisition and processing unit 204. The integration and acquisition period is described in more detail below. In the implementation shown, the detector 202 detects and outputs a sequence of input images 203 a-203 n, as directed by the image acquisition and processing unit 204.

The detector 202 may include the lens 106 that focuses light emitted or reflected from the object or scene 102 onto the focal plane array 108 of pixels 212 aa-212 nn. During each integration acquisition period or frame, each of the pixels 212 aa-212 nn accumulates a respective portion of the light from the object or scene and generates a corresponding signal or charge based on the intensity of the light. The electrical signals or charges generated by the pixels 212 aa-212 nn collectively represent an input image 203 a which is output by the detector 202 to the processor 204. For purposes of illustration and clarity in the discussion, each input image 203 a is assumed to correspond to, but not limited to, 256 by 256 pixels, referenced as 212 aa-212 nn in FIG. 1.

The data acquisition and processing unit 204, hereafter referred to as the processor, can be hard-wired circuits or a processor executing a suitable set of program instructions stored on a computer readable storage medium such as a random access memory (RAM), read only memory (ROM), magnetic storage medium (such as magnetic tape, disk or diskette) or optical storage medium (such as compact disk (CD) ROM). As illustrated in FIG. 2B, the processor 204 can include a central processing unit (CPU) 212, a static memory 214 and a main memory 216 that can communicate with each other via bus 218. Additionally, the processor 204 may include an input/output device 220 that may have an alpha-numeric device, such as a keyboard, a cursor control device, such as a mouse, and a video display device, such as a liquid crystal display (LCD), an organic light emitting diode (OLED), a flat panel display, a solid state display, or a cathode ray tube (CRT).

The data recording and storage unit 206 hereafter referred to as the memory, can comprise any type of memory including random access memory (RAM), electronically erasable memory (EPROM), and the like. Further, the memory 206 can comprise any type of storage including magnetic or optical drives, a local drive or a network drive, and a floppy disk, hard drive, CD-ROM, DVD-ROM, DVD-RAM, a tape drive, and the like. The communication interface 110 can comprise any type of interface for connecting to a communication network, such as a data or voice network, a land-line or wireless network, and the like. It will be recognized that one of ordinary skill in the art would understand how to build a communication interface, and hence, further description of this interface is omitted.

Now referring to FIG. 3, a block diagram illustrating an exemplary process 300 for removing scintillation from dynamic images performed by the imaging system 110 of FIG. 2A is shown. The process 300 is represented by a plurality of functional processing units or steps, provided in an exemplary identified order, which are executed by the processor 204. The illustrative processing units include an input unit 302, an image frame averaging unit 304, an image differencing unit 306, an image warping unit 308, and an enhanced image output unit 310. The image warping unit 308 may include a local area processing unit (LAP) 309. The order of these processing units may of course be varied to the extent an output of one processing unit may not be required as an input of another processing unit. The processing units can be linked or coupled sequentially as shown, or alternatively a subset of these processing units can be run in parallel with another subset. A reduced or streamlined process might adopt fewer processing units, without impacting the intended functionality consistent with the present invention. It should be understood that various aspects of the present application may be used together as would occur to one of ordinary skill in the art without utilizing all of the processing units thereof. In addition, different combinations of these processing units might affect operational speed of the corresponding resulting processes and other intermediary processing outcomes. Accordingly, it should be apparent that other permutations of these processing units can be developed without departing from the spirit and scope of the present invention.

Initially, a set of instructions corresponding to this sequence of processing units is launched on the processor 204. This launching may be performed automatically with activation of the detector 202. The image frames 203 a-203 n may enter the imaging system 110 at the input unit 302 at real-time rates, nominally at about 30 frames, or a previously recorded suitable video frame sequence is loaded from the memory 216. The image frames 203 a-203 n form an individual temporal frame sequence, denoted as frame sequence set 1 in FIG. 5, which is updated in a “first in first out” (FIFO) manner so as to include the most recent image frame, thereby becoming the most updated sequence with each newly acquired or uploaded image frame. As the name indicates, the FIFO sequence set 1 is an image buffer queue that is configured to include a predetermined number of input image frames. That is, when this predetermined number is reached, the oldest image frame is discarded and the newest image frame is added, thereby maintaining the same predetermined number of newest image frames.

In order to greatly reduce or remove scintillation effects, typically sequential frames are averaged. As known to one of ordinary skill in the art, when frame averaging is used, the image is improved. Three types of frame averaging are typically used: integration averaging, averaging, and recursive averaging. Integration averaging applies to a still image. Recursive averaging has the advantage that it can be performed on the fly.

Now referring to FIG. 4, a flow diagram 400 illustrating the process for removing scintillation from dynamic images performed by the imaging system of FIG. 2 is shown. The scintillation removal process 300 is initiated at step 402. The frame sequence set 1, which may represent a scintillation video (SV), comprises n successive frames 103 a-103 n received from the image input unit 302, at step 404. The first image frame 103 a heads the sequence of image frames 103 a-103 n, which may be captured by the detector 102 at real time rates, nominally about 30 frames per second (fps). Alternately, as stated above the sequence of image frames 103 a-103 n may be retrieved from the main memory 216 of the processor 204 or from the memory 206 as a previously stored sequence of image frames. At step 405, the received first image frame 103 a may be communicated to and displayed by the display unit 208 if the user desires and can be stored unprocessed, if newly acquired, in the memory 206. Steps 404 and 405 correspond to the input unit 302 in FIG. 3.

At step 406, the frame sequence set 1 is subdivided into a plurality of subsets of frames, which may be contiguous and/or overlapping. Preferably, these subsets of frames do not have the same of frames, so as to produce running scene averages of various durations. Each of the plurality of subsets of frames is then averaged to form a series of average image sequences, at step 408. For example, the frame sequence set 1 is averaged to form a first average image sequence, denoted AIS0 or Avg0, as shown in FIG. 5. A second average image sequence Avg1 may be formed by averaging image frames 103 a to 103 i, and a third average image sequence Avg2 may be formed by averaging image frames 103 i+1 to 103 j, where j≦n. The frame averaging of these subsets of frames, which is performed by the frame averaging unit 304, is used to achieve a stabilized background image by minimizing or even removing scintillation distortions from the images.

At step 410, a check is performed as to whether an additional subset of frames is queued for processing. In the affirmative, this additional subset of frames is averaged to produce Avg3 (not shown). Otherwise, at step 412 the first average image sequence Avg0 is selected as a reference averaged sequence.

At step 414, each of the Avg1 and Avg2 sequences are differenced from the reference averaged sequence Avg0 by the image differencing unit 306. In each AIS or Avg, the scintillation is substantially minimized or even removed but moving objects remain as motion blurs. As such, this differencing or comparison of the averaged image sequences enables the determination of the location of the moving objects in the images.

As shown in FIG. 6A, the averaged image sequence Avg1 is subtracted from the reference averaged sequence Avg0 to produce an image difference, denoted as AvgX. Similarly, the averaged image sequence Avg2 is subtracted from the reference averaged sequence Avg0 to produce an image difference, denoted as AvgY. These resulting image differences AvgX and AvgY are then added together to produce image difference AvgZ. This subtracting and subsequent adding of image differences result in the determination of a mask region in the image difference AvgZ from where the moving objects need to be extracted.

At step 416, a process of building masks for isolating and extracting the moving objects is commenced. The image difference AvgZ is threshold to form a masked image, denoted hereafter as AvgZ Mask, by selecting an appropriate threshold value that improves the determination of where the moving objects are located. Subsequently, the AvgZMask image is filtered or cleaned by a series of isotropic operators, for example an image erosion operator followed by an image dilation operator, as shown in FIG. 6B.

At step 418, a threshold is also applied to the above determined image differences AvgX and AvgY to produce respective masks MaskX and MaskY, which are in turn added to each other to produce image AvgB. That is, MaskX and MaskY are combined to form the composite mask image AvgB, as shown in FIG. 6C. Subsequently at step 420, the filtered AvgZMask is combined with the composite mask image AvgB to produce a final mask, denoted as ImgTh, which indicates the region or regions to be replaced. These final masks determine substantially precise cut out regions in the images for each of the moving objects detected. The AvgZMask image may also be filtered to remove undesirable or unwanted noise, as shown in FIG. 6D.

At step 422, an image offset, denoted as ImgOffSet, is produced by overlaying the reference averaged sequence Avg0 over the final mask ImgTh. Further, the region or regions determined above to be replaced are removed or cut out from the reference averaged sequence Avg0 to produce ImgAvg or Img0, at step 424.

Finally at step 426, the image offset ImgOffSet is combined with ImgAvg to derive a final image ImgFinal, in which the determined final mask ImgTh is inserted in the evaluated masked region, as shown in FIG. 6E. As such, this masking process enables extracting averaged moving regions and replacing them with the mask of corresponding true temporal regions. Steps 416 through 426 correspond to the processing unit 308.

As the scintillation removal process involves a circular buffer of input image frames, another final image ImgFinal is similarly produced using an updated frame sequence set 1. This updated frame sequence 1 is updated to become frame sequence 2 by adding the newest image frame 103 n+1 received from the image input unit 302 or uploaded from the data storage memory 206 or the CPU memory 216, and dropping the oldest image frame 103 a. As such, the frame sequence set 2 comprises n successive frames 103 a+1-103 n+1 that are processed to produce the another final image ImgFinal.

As each of the produced final images ImgFinal may include parts that are undesirably light or dark, a LAP process may be utilized to develop desirable contrast and brightness of the pixels of these problematic parts, at step 428. The LAP unit 309 serves to improve image interpretability so as to allow an observer to see objects in the images that may otherwise be difficult or near impossible to assess in the original high-contrasted and high-brightened images. Once balanced and sharpened by the LAP unit 309, the final images ImgFinal are output by the image output unit 310, at step 430.

One exemplary embodiment of the LAP unit 420 is functionally represented by an algorithmic flow diagram illustrated in FIG. 7. Each of the final images ImgFinal is initially processed in a sharpening routine 704 which sharpens images, such as the multi-resolution un-sharp process discussed above. Next, a tiling routine 706 which involves breaking or dividing the final images ImgFinal into tiles, having about 20 to 80 pixels to a side. This tiling of images enables each tile to be processed or treated independently by predetermined subsequent processing units, such as the local histogram unit 708.

The local histogram process 708 enables an efficient redistribution of dynamic range of each of the tiles composing the final image ImgFinal, which can achieve a maximum dynamic range for each tile. Next, a mosaic routine 710 is used to reassemble or bring together all tiles the final image ImgFinal in the same arrangement they had prior to the tiling routine 706.

Finally, a global sharpening is performed on the whole regrouped final image ImgFinal by the global image process 712 so as to achieve optimum image acuity. The global sharpening process 712 emphasizes edges and other high frequency components of the final image ImgFinal, as well as differences between adjacent light and dark areas or spots. Typically, sharpening processes involve employing filters to decompose images into many frequency bands, attenuating low frequency bands, amplifying high frequency bands, and recombining modified spatial frequency bands.

While various embodiments of the present invention have been described, it will be apparent to those of skill in the art that many more embodiments and implementations are possible that are within the scope of this invention. Accordingly, the present invention is not to be restricted except in light of the attached claims and their equivalents. 

What is claimed is:
 1. A method for reducing scintillation in an image of a scene, the method comprising: receiving an input sequence of images of the scene; grouping a first set of images of the sequence of images into a first subset of images comprising a first number of images that occur in sequence within the input sequence of images of the scene; grouping a second set of images of the sequence of images into a second subset of images comprising a second number of images that occur in sequence within the input sequence of images of the scene, wherein a last image in the first subset of images occurs in time before a first image of the second subset of images, generating a set of statistically combined images comprising a combined image for the first subset of images and a combined image for the second subset of images, wherein the step of generating the combined images comprising: selecting the first subset of images as a reference subset images; selecting averaged image of the first subset of images as a reference average image; creating a set of different images, wherein each different image of the set of difference images corresponds to an averaged image that is not the reference averaged image, wherein each difference image is created by subtracting the averaged image by subtracting the average image that corresponding to each difference image; creating a composite mask that determines at least one region of the averaged reference subset of images to be replaced using at least the set of masked images and the particular masked image; and creating a composite image using at least the composite mask and the averaged reference subset of images; and outputting a composite image based at least on the set of statistically combined images.
 2. The method of claim 1 further comprising: grouping a third set of images of the sequence of images into a third subset of images, wherein a particular image of the sequence of images is included in both the second subset of images and the third subset of images.
 3. The method of claim 1 further comprising creating a particular difference image by adding together each of the difference images in the set of difference images.
 4. The method of claim 3 further comprising: selecting a threshold value; creating a set of masked images by applying the threshold value to each difference image in the set of difference images; and creating a particular masked image by applying the threshold value to the particular difference image.
 5. A non-transitory computer-readable storage medium comprising a plurality of computer-readable instructions tangibly embodied on the computer-readable storage medium, which, when executed by a data processor, provide for minimizing scintillation while retaining moving objects in an image of a scene, the plurality of instructions comprising: instructions that cause the data processor to receive an input sequence of images of the scene; instructions that cause the data processor to group a first plurality of images of the sequence of images into a first subset of images comprising a first number of images that occur in sequence within the input sequence of images of the scene; instructions that cause the data processor to group a second plurality of images of the sequence of images into a second subset of images comprising a second number of images that occur in sequence within the input sequence of images of the scene, wherein a last image in the first subset of images occurs in time before a first image of the second subset of images; instructions that cause the data processor to generate a set of averaged images comprising an averaged image for the first subset of images and an averaged image for the second subset of images, wherein the step of generating the combined images comprising: selecting the first subset of images as a reference subset images; selecting averaged image of the first subset of images as a reference average image; creating a set of different images, wherein each different image of the set of difference images corresponds to an averaged image that is not the reference averaged image, wherein each difference image is created by subtracting the averaged image by subtracting the average image that corresponding to each difference image; creating a composite mask that determines at least one region of the averaged reference subset of images to be replaced using at least the set of masked images and the particular masked image; and creating a composite image using at least the composite mask and the averaged reference subset of images; and instructions that cause the data processor to output a composite image based at least on the set of averaged images.
 6. The non-transitory computer-readable storage medium of claim 5 further comprising instructions that cause the data processor to group a third plurality of images of the sequence of images into a third subset of images, wherein a particular image of the sequence of images is included in both the second subset of images and the third subset of images.
 7. The non-transitory computer-readable storage medium of claim 5 further comprising instructions that cause the data processor to create a particular difference image by adding together each of the difference images in the set of difference images.
 8. The non-transitory computer-readable storage medium of claim 7 further comprising: instructions that cause the data processor to select a threshold value; instructions that cause the data processor to create a set of masked images by applying the threshold value to each difference image in the set of difference images; and instructions that cause the data processor to create a particular masked image by applying the threshold value to the particular difference image.
 9. An imaging system, comprising: an image detection unit; and an image acquisition unit in communication with the image detection unit, and having a memory and a processing unit coupled to the memory; wherein the processing unit is configured to execute a method for minimizing scintillation while retaining moving objects in an image of a scene, the method comprising: receiving an input sequence of images of the scene; grouping a first set of images of the sequence of images into a first subset of images comprising a first number of images that occur in sequence within the input sequence of images of the scene; grouping a second set of images of the sequence of images into a second subset of images comprising a second number of images that occur in sequence within the input sequence of images of the scene, wherein a last image in the first subset of images occurs in time before a first image of the second subset of images, generating a set of statistically combined images comprising an statistically combined image for the first subset of images and an statistically combined image for the second subset of images, wherein the step of generating the combined images comprising: selecting the first subset of images as a reference subset images; selecting averaged image of the first subset of images as a reference average image; creating a set of different images, wherein each different image of the set of difference images corresponds to an averaged image that is not the reference averaged image, wherein each difference image is created by subtracting the averaged image by subtracting the average image that corresponding to each difference image; creating a composite mask that determines at least one region of the averaged reference subset of images to be replaced using at least the set of masked images and the particular masked image; and creating a composite image using at least the composite mask and the averaged reference subset of images; and outputting a composite image based at least on the set of statistically combined images.
 10. The imaging system of claim 9 the method further comprising grouping a third set of images of the sequence of images into a third subset of images, wherein a particular image of the sequence of images is included in both the second subset of images and the third subset of images.
 11. The imaging system of claim 9 the method further comprising creating a particular difference image by adding together each of the difference images in the set of difference images.
 12. The imaging system of claim 11 the method further comprising: selecting a threshold value; creating a set of masked images by applying the threshold value to each difference image in the set of difference images; and creating a particular masked image by applying the threshold value to the particular difference image. 